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@ Amino acid sequences and polypeptides including these sequences and having the specificity of foot and mouth disease 
and other viral antigens. 



@ An amino acid sequence or a synthetic polypeptide 
consisting of the same and preferably containing a helical 
structure is capable of being recognised by two or more 
amisera raised against different viral strains by virtue of the 
rotationaliy and transiationatly related different aspects 
presented by the helix to the antibodies. In particular, amino 
acid sequences of this invention may have rotationaliy 
related ephopic sites recognisable by different strains of foot 
and mouth disease virus. 
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y AMINO ACID SEQUENCES AND POLYPEPTIDES INCLUDING THESE 
SEQUENCES AND HAVING THE SPECIFICITY OF FOOT AND MOUTH 
DISEASE AND OTHER VIRAL ANTIGENS. 

This invention relates to amino acid, sequences, to ^ 
5 polypeptides consisting of the same and to processes for 
producing these sequences and polypeptides, which have 
the specificity of two or more antigens of certain 
viruses and, in particular, of foot and mouth disease 
viral antigens. 

10 The natural iiomune system is based upon the well 

known antigen-antibody reaction', in which an antigen 
(which is foreign to the host body - an example is a 
virus) stimulates the production by the host of 
antibodies specific to the antigen. These antigen- 

15 specific antibodies react with the antigen and. in a 
manner which is still being investigated, neutralize 
it. The antibodies so biosynthesized recognize specific 
"surface" features of the molecular structure of the 
antigen. 

20 Antibodies to. proteins and polypeptides can be 

conveniently classified into "conformational" and 
"sequential" types, according to whether they recognise 
the relatively static conformation of an antigenic 
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determinant region, or an inherently more flexible 
sequence of amino acid residues. Most short, free 
polypeptides tend to lack any secondary structure 
sufficient to give a relatively static surface to form 
5 an antigenic determinant of the conformational type. 
However, in certain polypeptides where the amino acid 
residues have collectively sufficient helix-forming 
potential, the polypeptide can adopt a rigid helical 
structure in which the movement of the side chains of 
10 the amino acid residues is relatively spatially 

restricted. In this situation a specific antibody able 
to recognise the polypeptide would only be expected to 
do so when, the helix was in the correct position with 
respect to the combining site of the antibody, that is. 
15 in a position such that those residues along the helix 
surface which collectively have the correct topology and 
charge distribution to form the antigenic determinant 
site are complementary to the combining site of the, 
antibody. Expressed in geometrical terms, combination 
20 only occurs at the correct rotation about and 

translation along the helix axis with respect to the 
antibody. 

Most viruses comprise a core of genetic material 
surrounded by a coat of protein or the like and it is 
25 this coat (or, more precisely, specific regions of the 
molecular structure of this coat) that antibodies 
recognize and neutralize. It is clear that an immune 



" 3 ~ 0114759 

response to a given virus. could be induced by injecting 
into the host a protein corresponding to the native coat 
protein or even a compound who.se surface structure is or 
includes a. section which is essentially the same as ,the 
5 antigenic determinant regions of. that coat protein. ^ 
Where an amino acid sequence - forming part of the . coa*^ 
protein has adopted a helical configuration,, some of the 
side chains on the amino acids making up that sequence 
will be directed into the interior of the protein as 
10 part of the hydrophobic core. ^ It is only ..those side 
chains which are directed outwards and which are . ^ . 
accessible to solvent that can be recognized by an 
antibody and are thus of importance when designing a , 
synthetic compound to .induce the same antigenic response. 

15 ; Synthetic compounds of this type have the potential 
to be of considerable value., especially where diseaae^ 
control is currently -by means of killed or. attenuatjed 
virus vaccines, which can be potentially dangerous. . 
However, most viruses exigt as a number of strains and a 

20 vaccine effective against one strain may be ineffective 
or insufficiently effective against another. 
Accordingly, it would be necessary to manufacture 
separate compounds for each strain of the virus.. EP . . 
44710 discloses a possible solution to this problem by 
linking together in a single chain, which may be 

25 helical, two or more small peptide units, each unit, 
containing a single antigenic determinant region. 



Foot and mouth disease ("FMD"), also known as hoof - - 
and month disease, is a problem of major economic 
importance in many countries. Control of the disease is 
primarily with killed vims . vaccines . A major problem 
5 with the production and application of the vaccine is 
the extreme variation found in the antigenic 
determinants on the FMD virus ("FMDV") particle 
responsible for eliciting an immune response. 

Thus. FMDV exists as seven serotypes comprising more 
10 than 60 strains. Immunity to one serotype does not 
confer protection against the other six. Furthermore, 
control of the disease by vaccination is complicated by 
the extreme antigenic lability of the virus even within 
serotypes. The extent of this variation is such that 
15 vaccination will not necessarily give prolonged 
protection against other strains of the virus even, 
within the same serotype. The antigenic variation: 
' between the serotypes is due to variation in the amino 

acid sequence of the VPl capsid protein (which is the 
20 major coat protein of FMDV and which carries the major 
immunogenic portion of the coat), giving rise to 
structural variations in the sites (epitopes) which 
combine with antibodies. Similar problems exist with 
many other viruses, for example, the polio virus, or the 
25 viruses of influenza, hepatitis, viral hog cholera, 
rabies, smallpox, measles, rubella or mumps. . 
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Two antigenic sites within VPl have been 
recognised. The first is. around residues 138-154 and 
the;second is from 200 to the C terminal. The second is 
probably a "sequential" type of determinant, but the - 
first is "conformational" , and hence more suited for 
.secondary and tertiary structure analysis. In this 
specification, as is conventional in the art. an amino 
acid position in a given sequence is assigned a number 
equating to the number of amino acid residues in the 
sequence, counting from the N terminal to that 
position. In considering synthetic sequences analogous 
to a natural polypeptide or to part of a natural 
polypeptide, it is generally convenient to base the 
numbering system on the natural polypeptide and to use 
that system for numbering the synthetic sequence, even 
though the synthetic sequence may be different from the 
natural sequence. Such differences may be of three 
types: 

(a) an amino acid at position "x" of the natural 
sequence may not be present in the synthetic 
sequence - this is referred to as a "deletion at 
position X" ; 

(b) an amino acid present in the synthetic sequence 
may not have a counterpart in the natural sequence - 
this . is referred to as an "insertion between 
positions x and x 4- 1 " of the natural sequence;^ and 

(c) an amino acid at position x of the natural 
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Beguence may be replaced by a different amino acid - 
this is referred to as a "substitution at position 

In the case of the VPl protein of FMDV, the nuinbering is 
based -upon a sequence identical to that of the VPl 
protein of the O^K vims type, except that the O^K 
sequence has a deletion at position 199* 

I have now discovered that it is. possible to include 
2 or more antigenic determinant regions on a single 
small polypeptide molecule, which may be helical, by 
arranging that the two or more regions are. in part or 
in whole, occupying the same parts of the molecule, e.g. 
the same turns of the helix. The amino acid seguence 
carrying those regions can be relatively small, e.g. 
from 7 to 30. more preferably from 10 to 20 and most 
preferably from 15 to 17. amino acid units long. 

In the present invention, the seguence of amino 
acids which gives rise to one antigenic site is at least 
partially interpolated with other amino acids so that 
the conformation of the molecule provides at least two 
antigenic sites in which the amino acids of each site 
are conf ormationally contiguous, and said other amino 
acids forming at least part of one antigenic site. 



■--Thus, the present invention consists in a molecule 
comprising an amino, acid sequence having at least two 
antigenic regions.' each said region being capable of 
being recognized by antibodies raised against a viral 
.5 protein sequence, each said region being formed of a 
chain Of amino acids of said sequence, the chains 
forming said regions having at lea.st one amino acid 
(preferably more than one amino acid) of said sequence 
in common, so that the chains at least partly overlap 
1.0 and are not contiguous along the length of the 
molecule. The sequence. is preferably such that it 
adopts a helical conformation, however, other 
conformations are possible, for example other sequences 
may form a structure similar to a B-sheet. 

15 : The" degree of overlap between; the two (or more) 
antigenic regions and. hence, the chains forming them 
may vary considerably. Substantial overlap between the' 
• regions is clearly desirable, in order that the sequence 
may beas Short as possible - thereby facilitating 

20 synthesis. For example, overlap such that at least 25% 
Of the amino acids in one of said chains also form part 
of the other of said chains is preferred. An overlap of 
at least SOX is more preferred and a greater degree of 
overlap. .e.g. at least 75%. may be aimed. for. if 
25 possible, in some cases, it may be possible to achieve 
or. at least, approach loo% overlap. 
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The laolecule of the invention is preferably a 
polypeptide - 

The. seqiaence of the invention is preferably coiipled 
to a larger compouna to form a single molecule, in order 
5 ■ to facilitate antibody response, as is well known. 

Examples of such larger compounds include keyhole limpet 
haemocyamin, bovine serum albumin, btg, B-galactosidase 
or penicillinase. The sequence of the invention may be 
covalently attached to the molecule of the larger 
10 compound or. especially where it and the larger compound 
are constructed together by recombinant DNA technology, 
the sequence of the invention may be linked in some„ 
other manner to the larger compound, for example, where 
the larger compound is a polypeptide such as 
15 B-galactosidase or penicillinase, it may be interposed 
in the amino acid chain of that compound. 

The molecules disclosed herein. are characterised by 
having the specificity of two or more viral antigens, 
especially foot and mouth disease viral ("FMDV") 

20 antigens. As will be apparent from the disclosTire to 
follow, the polypeptides, recombinant DNA molecules, and 
processes of this invention may be used in the 
production of antigenic polypeptides useful in 
diagnostic methods and in compositions for reniering 

25 animals resistant to FMDV for some period of time. , 
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Secondary strtJcture analysis of the amino acid 
sequence of VPl protein of FMDV type O^K by the method 
of. for example, Chou and Fasman [Adv. Enzym. 47, 45-148 
(1978)] shows that a large part C144-160) of this 

5 sequence is helical, and furthermore that the 

corresponding sequence in the VPl protein of FKDV type 
C3, which contains several :amino acid substifution? . . 
compared to O^K, is also helical. These sequences- can 
also be analysed to see which groups are contiguous with 

10 the hydrophobic core of the VPl protein. The basis of - 
This method and the rules for one amino acid have-been 
published [Nicholson. Biochem. Soc. Trans.. 10/ 387 
-(1982)]. The method is based on the detailed analysis 
of the side-chain contacts of 24 proteins whose tertiary 

15 structure has been determined by X-ray crystallography, 
and the position of a particular residue with respect to 
the hydrophobic core of the protein correlated with the 
sequence of residues adjacent to it in the primary,; 
sequence, it being known that in nature the primary 

20 sequence determines the secondary and tertiary structure. 

In FMDV type O^^K. all the residues in the core are 
on one side of the helix, except for Val 150. which 
. would require a proximal side chain from elsewhere in 
the molecule to satisfy the core definition, A 
25 noticeable feature of the FMDV type O^^K helix is the 
series of four polar side chains forming a "band" down 
one side which must be just above the surface of the 



contiga36us protein, since the helix residiTes adjacent to 
the polar band residues are in the interior core of the 
rbst of the protein. 

The amino acid snbstitutions in the corresponding 
5* segiaences of the different strains are shoian in Table 1: 
' TABLE 1 



Epicylin 
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Til n 








i r* rr 


T FTT 


^ T* rr 




^ IT 
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Ar g 




Ser 


A TCI 


7i r fT 


c 

V 


civ 




\3 j.y 






7 


Asp 


Asp 


Asp 


Asp ' 


Asp 


8 


; Leu 


Leu 


Leu 


MET 


Leu 


9 


Gin 


Gin 


GlV 


Gly 


ALA: 


150 


Thr ; 


VAL 


Ser 


Thr 


His 




Le^} ■ 


Len 


Leu 


Leu 


. Leu 


2 


. Ala 


Ala- 


Ala 


Ala 


Ala 


3 


Gin 


Gin 


ALA 


ALA 


ALA 


4 


Lys 


Lys 


Ars. 


Arg 


Ala 


5 


His ; 


VAL 


VAL 


VAL 


His 


6 


Ala 


Ala 


Ala 


VAL 


Ala 


7 


Arg 


Arg 


Thr 


LVS 


Arg 


• 8 


His 


Thr 


Gin 


Gin 


His 


, 9 


Leu 


Leu 


Leu 


Leu 


Leu 
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In the above table and throughout this 
specification, the amino acids are identified by the 
internationally recognized standard three-letter amino 
acid code. The ami-no acids are underlined where they 
5 differ from those of Epicylin P/0^K/C3 (which is a 

sequence, in accordance with the present invention) and 
are in capitals where they are predicted to be in the 
protein core. 

Their effect is as follows. In FHDV type C3. the 

10 Gin at 149 and' 153 of O^^K are replaced by Ala. both of 
which are now predicted to be in the core; Lys 154 is 
replaced by Ala (non-core): and some basicity is 
regenerated by the . appearance of three His for two Val 
(150. 155) and Thr 158. Thus in FMDV type C3. and to 

15 some extent in types AlO and A24. the bulKy polar side 
chains of the left hand side have been replaced by small 
hydrophobic side chains which are in the core and at the 
■ same time the right hand side has the two core Val 

replaced by non-core His. Effectively therefore, in the 

20 intact protein VPl the helix viewed from the N-terminal 
has rotated anticlockwise with respect to the core, 
presenting a different topology for the constant 
features relative to the antibody binding sites, in 
addition to the substitutions themselves. The rotation 

25 may well be assisted by the deletion in the 140-145 

range, which might explain why these residues appear to 
be involved in the antigenic response. Thus the 
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antibody combining sites of the two epitopes are related 
by rotation about the helix axis. 

The present invention utilises this discovery in 
developing synthetic polypeptides of appropriate amino 

5 acid sequence such that they form rotationally related 
epitopes which may interact with antibodies raised 
against different strains of virus and on coupling to 
suitable carrier proteins of natural or synthetic 
protein conventionally used for preparing antigens e-g. 

.10 bovine serum albumin, induce antibody formation. 

Also in Table 1. under the heading "Epicylin 
P/0j^K/C3", is shown an amino acid sequence in 
accordance with this invention and this has been found 
to elicit antibodies which neutralize virus type C3. as 
15 well as virus type O^^K, from which its structure is 

derived. The amino acid sequence produced comprised the 
. amino acid residues labelled 144-159. It was 
synthesized by recombinant DNA technology by inserting a 
genetic sequence corresponding to the desired amino acid 
20 sequence between the EcoRl and BamHl sites of a 
. B-galactosidase gene in a modified plasmid vector 
(derived from pBR322) according to conventional 
methodology, and as illustrated subsequently in the 
Example . , 
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Another amino acid sequence. Epicylin AP/A10/A12, 
was produced by chemical synthesis by inserting arginine 
between positions 145 and 146 of a sequence 
corresponding to that of the VPl protein of virus type 
AlO. The method employed was that of Bittle et al. 
[Nature 298.. p 30-33 (1982)']. The structure of this 
Epicylin AP/A10/A12 is shown in Table 2. together, for 
comparison, with the corresponding sequences of VPl 
protein of FMDV types AlO, A12A and A12B. 
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■ . ' ' TABLE 2 
Epicylin . •• 







AP/A10/A12 


Aio 


, A12A . 


A12B 




144 


Arg 


Arg 


Val 


Val 


5 


145 • ■ 


Ser 


Ser 


Arfl , 


■Ar£ 




insert 


Arg 




— — 


^— ' 




146 


Gly 


Gly 


Gly 


. Gly 




147 


Asp 


Asp 


Asp 


Arg 




148 


Leu 


Leu 


Ser . 


Leu 


10 


149 • 


Gly 


Gly . 


Gly 


Gly . 




150. 


Ser 


Ser 


Ser 


Ser 




151 


lie 


lie . > 


Leu 


Leu 




152 


Ala 


Ala 


Ala 


Ala 




153' 


Ala 


Ala 


Leu 


Pro 


15 


154 


Arg 


Arg 


Arg 


Arg 




155 


Val 


Val ' 


Val 


Val 




. 156 . 


Ala -. 


Ala 


Ala 


. : Ala ■ 




157 


Thr ■ 


Thr 


Ar£ 


Arg. 




158 


Gin. 


Gin 


■ Gin 


Gin 


20. 


159 


Leu 


Leu 


Leu 


Leu 



The amino acid sequences shown in Table 2 were 
each coupled to keyhole limpet haemocyanin and their 
antigenicities determined by preparing antisera in 
guinea pigs, which were then assayed for their ability 
to neutralize FMDV by mixing virus with an equal volume 
of normal or anti-serum and . in jecting 0.03 ml of the 
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mixture into groups .of 7 . day old mice, as described in 
FEES Letters/ Volv. 157 No. 2. p 261-264 (1983). 

The results are shown in Table 3, which shows the. 
effectiveness of antibodies elicited by each of the 
5 amino. acid sequences shown in Table 2 against viruses of 
types AlO. A12A and A12B. 

TABLE 3 ' 



Amino Acid 
Sequence 


Virus type 


. AlO 


A12A 


A12B 


AID 


>++ 






A12A 




+++ 


+ 


■ A12B ; 






+++ 


' Epicylin . 








AP/A10/A12 


+++ 




++ 



15 +++ = excellent .neutralization 

>+ s good neutralization 

•f = moderate neutralization 

- = no neutralization 



In Epicylins P/0^K/C3 and AP/A10/A12, it is. of 
20 course, possible to replace one or more of the amino 



acid residiaes thereof by" other' acid residues*;, especially 
where the residues are in the protein *core/dnd thus can 
be expected to have no effect on antigenicity. Thus, 
antigenic equivalents of these sequences also form part 
5 of the present invention. 

Examples of other sequences in accordance with the 
invention and derived from F^5DV VPl protein are given 
below. In these sequences: 

X represents a deletion; 
10 parentheses indicate that the amino acid(s) 

immediately preceding them may optionally be replaced by 
the amino acid(s) within them; 

" + " indicates that the two or more consecutive acids 
joined by signs may together be replaced by the 

15 acid(s).in parentheses immediately following them: 

/..:'•/ indicates the position in a native protein of 
the amino acid to its immediate left. 

• EPICYLIN P/A12/C3 : . : V 

Arg-Gly-Asp-Phe-Gly-Ser-Leu-Pro-Arg/C3-143: A12-154/- 
20 Val(Ala)-Ala-Arg/C3-145;A12-157/-Ala(X)-Gly/C3-146/.Asp- 
Leu-Ala-His-Leu-Ala-Ala-Ala-His-Ala-Arg-His-Leu/C3-159/- 
Pro . 
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■~~ EPICYLIN AP/C3/A12 . - — — ^— - - 

Arg-Arg/C3 -14 5/-Gly -Asp-Leu-Ala -His -Leu-Ala-Ala-Ala -Hi s- 
Ala-Arg-His-Leu/C3-159/-Ala{Asp or Glu5-AsptGlu or Ala)-. 
Phe/A12-14 8/-Gly-Ser-Leu-Ala-Pr.o-Arg-Val-Ala-Arg-Gln(X)- 

5 Leu{X)/A12-159/ 

EPICYLIN AP/A10/C3 - 1 

Leu(Ala)/A10-159/-Gln-Thr-Ala-Arg(Val)-Arg/A10-154: 
C3-145/-Ala{Gly)/A10-146/-Asp-Leu(Ile or Ala)- 
Ser/A10-150/-His/C3-150/-Leu-Glu+Ala (orAla+Glu)- 
10 Ala-His-Ala-Arg-His-Leu(Ala)/C3-159/-Arg(Ala)-Ala(X) 
Alternatively for first six positions: 

Len-Gln-Thr-Arg-Val(Ala)-Arg/A10-154/- 
EPICYLIN AP/A10(A12^/C3 - 2 

This synthetic seq^uence is . terminated at both ends by . 

15 the carboxyl ends of the AlO and C3 sequences. It is 
possible to perform two separate syntheses and link the 
C3 Arg-143 to AlO Gly-146. Alternatively part of AlO 
146 may be synthesised with opposite polarity from 
Gly-146 to Lei3-159. This polypeptide should also have 

20 A12 activity. 



_. 0 114759 

" - Leu(Ala,/A10Ti5y/:Gln-^^^^^ 

Leu.Asp_Giy-Arg/A12-145; C3-143/-Ser (X).Arg/C3-145/-Gly- 

Asp/C3-l47/-Le.:-Ala-His.Lev-Ala.Ala-AU 
Leu/C3-159/ 

^ EPICYLIW P /A10rft17W n^ 

.Arg/A10-l43/*Ser(XH-Arg/C3-143/).Arg/A12-145/-Gly/A10-l46/- 
.^^^.^^^-^^"^^^^^^ 

Ala-Ser/A10.150/.Ala ( Ile).His>AXa-Arg/C3-l57; ,A10-154/- 
His-LeT3(Ala)-Thr-Gln-Ala 

10 EPICYLIW P/^aT vsAT 2 - l - 

Arg/SAT2-145/-Pro-Arg/SATl-i45; SAT2-insert/- G1:./SAT2- " 
insert/-Asp/SATl-l47/-Val(Leu or Ala)/SAT1-148/-Ala- 

Thr/SATl-l50/{Gln/SAT2-150/)-Leu-Ala-Ala-Arg-ile{Tyr)- 
Ala(Set)-Glu(Ser or Ala)-Thr(LyB).His-Ala(Leu or He or 
5 Ser) •■■ ■ ■ ; 

EPICYLIN P/SATl/Sar? - -> 

Pro/SATl-i44/-Arg-Gly-Asp-Le^-Ala-Thr-Leu-Ala.Ala- 
Arg/SATl-154; SAT2-insert/.Arg/SAT2.l45/-( Ile/SATl-146/- ■ 
Ala-Ser-Glia/SATl-insert: SAT2-insert/-Thr(Val)-His(Leu)- 

,Ile/SATl-l59/(Ala).Gln/SAT2.l50/-Lys-Tyr/SAT2-152/-Ser- 
Ser-Ala-Lys-His-Ser-Leu(Ala)/SAT2-159/ 



EP.ICYLIN P/SATl/O ^fC ■ 

Pro/SATl-.144/(Leu/0j^^-l44/)-Arg-Gly/0^-146/-Asp- 

LeT3(Ala)-Gln-Thr(Val)-Leu-Ala-Gln-Arg(Lys)-Jle-Ala- 
Arg-Thr+Glu(Glu+Thr)-His/SATl-i58/-Leu . 

5 EPICYLIN AP/SATl/Q ^K 

Ala-Arg/0^-l4 5/-Glu(Thr/SATl-157/)-Asp(GlTi)-Ser-GlD- 
Val/o^K-l50(Ile/SATl-155/)-Arg-Ala-Gln-Lys /O K-\ 
154/ (Leu or Ala ) /-Thr-Ala-Arg/0^K-lS7/ (Lys ) -Asp/SATl- 
147/(Glv' or Tlir)-Gly-Arg-Pro 

10 EPICYLIN AP/O ^ K/A10(A12^ 

Leu-Arg-Gly/Oj^K-14 6/-Asp-Le\3-Gln-Val-Leu-Ala-Gln-Lys- 
Val-Ala-Arg/O^K-157: A10-154./-Thr-Ala/A10-153/-Ala (X)- 
Ile{Ala)-Ser-Gly-Leu-Asp-Gly-Ser+Arg{Arg + Ser + Arg) 

EPICYLIN P/A10fA12^/0 K : 

15 Arg/AlO-143: Oj^K-145/+Ser+Arg(Arg + Ser + X)-Gly- 
Asp/Alo-147; 0^K-147/-Leu(Ala or Ile)-GlD-Tlir- 

Va 1 ( Set J -Leu ( 1 1 e ) -Al a-G Iji-Ar g ( Ly s ) -Va 1 ( Ala ) -Ala -Ar g -Gl n 
Ala (Leu) 

Other sites from the 130-144 region may be built in e.g 
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EPICYLIN P/A12/0 ., C (BFS) 

Arg-Gly-Asp-Phe-Gly-Ser-Leu-Ala-Pro-Arg/b^C-135/-yal- 
Ala-Arg/A12-157/-Gly(X)-Asn(Gln)/0^C-139/-Ala-Ser-Gly 

EPICYLIN P/SAT2/A10 

5 Ile-Arg-Gly-Asp/SAT2-147/-Arg-Gni-Val-Leu/SAT2-148/-Ala- 
Asn/A10-13 4/(Gln/SAT2-150/)-Lys/SAT2-151/-Tyr-Ser- ^ 
Ala ( Ser ) -Ser CAla ) -Asp ( Lys ) -Ser CHis ) -Arg-Ser-Pro 

Sequences are also constructed to include another 
antigenic region (200-213) of the viral capsid. protein 
10 . e.g. : 

EPICYLIN P/C3/0 j^ 

C3 residues 144-153 and then Ala-His-Ala-Arg/C3-157 : 
■ 0^-201/-His/C3-158; 01-202/-Lys-Gln-Lys-Ile-Val-' 
Ala-Pro/0^-209/ 

15 EPICYLIN P/AlO/O j^K 

Arg+Ser+Arg(Arg+Ser+X)-Gly-Asp-Leu-Gly-Ser-Ile-Ala- 

Ala-Arg/AlO-154: 0^K-201/-His (Tyr or Val)-Lys-Thr- 
Gln/A10-158 : 0^K-204/-Lys-Ile-Val-Ala-Pro 
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EPICYLIN P/SAT2/0 j^K 

Arg/SATZ-l^SZ-Gly-Asp-Arg-Gl-G-Val-Leu-Ala-Gln-Lys-Tyr-Ser 
Ser-Ala-Lys/SAT2"156/"His-Ser/SAT2"158/(Gln)-Lys-Ile-Val- 
Ala-Pro/01-209/ 

5 Although this invention is described principally 

vith reference to FMDV it niay be of use for other 
viruses, e.g. polio virus, whose antigenic sites may be 
represented as overlapping sites on the surface of a 
helix. The advantage of the method is that a single 
10 polypeptide only need be synthesised instead of the tvo 
or more normally required, to produce the antibody 
. response to two or more antigenically distinct strains. 
The invention has been shown to be of value with the 
antigenic regions of the capsid proteins of 
15 picornaviruses (of which FMDV and polio virus are 

examples). It may also be applied to such other viruses 
as those, of influenza, hepatitis, viral hog cholera, 
rabies, measles, rubella, mumps or (should it ever again 
become a problem) smallpox . 

Polypeptides or other compounds comprising the amino 
acid sequences of the invention may be formulated in a 
conventional manner with carriers or adjuvants and are 
preferably formulated for injection, although other 
routes of administration are possible. 
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A further advantage of the invention in common with 

all synthetic polypeptide vaccines is that contamination 

• . ' ^ - • ■ ■ ■ ■ ■ ' ' ' ■ • . - .■: 

with potentially . inf ective material is avoided. 

Another advantage is that, it is relatively easy to 
5 clone a synthetic DNA segiaence corresponding to the 

amino acid sequence of the invention into a vector such 
as pBR322, since this plasmid readily accepts short 
lengths of DNA. and gro^7th in £. coli HBlOl or similar 
would, allow expression of the polypeptide, which could 
10 be further enhanced by inclusion of promoters in the DNA 
sequence and other published procedures. 

Since the polypeptide is small, and helical 

structures may readily be disrupted and reformed in 

vitro , spontaneous formation of the helical secondary 
15 structure is. assured. 

The polypeptide can be prepared according to any 
conventional method of synthesising peptides. Either 
solid phase processes or liquid phase processes may be 
employed, with the latter being advantageous in many 

20 cases. Such processes for synthesising peptides are 

described in. for example, "The Peptides", vol. 1 (1966) 
written by Schroder and Luhke (Academic Press, New York, 
U.S.A.) or "Peptide Synthesis" written by Izumiya et 
al . . Maruzen Co. Ltd. (1975), and there are illustrated. 

25 for example, an azide proc ss, a chloride process, an 
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acid anhydride process, a .mixed -acid anhydride process, 
a DCC process, an active ester ■ process , a process ^sing 
Woodward reagent K, a carbodiimidazole process, an 
oxidation-reduction process, a DCC/additive (e.g. HONB, 
HOBt, HOSu, etc.) process, and the like. 

In particular, the polypeptide or its amino acid 
sequence can easily be synthesised by conventional 
processes for synthesising polypeptides especially the 
so-called stepwise process in which amino acids are 
condensed one by one onto a terminal amino acid, or by a 
process of coupling several fragments. More 
specifically, the above-described peptide can be 
prepared by condensing a reactive carboxyl group- 
containing starting mater ial corresponding to one of the 
two fragments formed by separating the peptide at an 
arbitrary bond position with another reactive amino 
group-containing starting material corresponding to the 
other fragment in a manner conventionally em^ployed for 
the synthesis of peptides and, where the resulting 
condensate contains a protective group or groups, 
removing the protective group{s) in a conventional 
manner. 

The synthetic peptides of this invention may be used 
for treatment, prevention or diagnosis of viral 
infections of mammals including man. Vaccines of the 
peptides may be formulated and used in a conventional 
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_ roanner (see for example European Patent Publication No. 
0 090 581 and U.K. Patent Specification No. 1 083 815). 

Advantageously/ the peptides are administered 
sub-cutaneously or intrainuscularly at doses of between 1 
5 to 100 microgram per kilogram body veight of the mammal, 
preferably from 10 to 50 microgram/kilogram. Maximum 
protection is likely to be afforded by a plurality of 
administered doses spaced from 2 to 8 weeks apart. 

The invention is illustrated by the following 
10 Example, which should not be construed as a limitation 
thereof, and which illustrates the preparation of 
Epicylin P/0^K/C3. 

EXAMPLE 

i. Construction of Plasmid pXY413 . 

15 The methods used for. plasmid construction and 

analysis are described by Maniatis et al . [Molecular , 
Cloning, A laboratory manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1982)]. 
Techniques useful for the analysis of B-galactosidase 

20 are described by J. Miller [Experiments in molecular 
genetics. Cold Spring Harbor Laboratory. Cold Spring 
Harbor. New York (1972) ] . 
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Plasmid pXY413, an open reading frame vector, was 
constructed as shown in Figure 1 of the accompanying 
drawings. The three starting plasmids were pEA300 
obtained from Dr. Egon Aman (Harvard University. 
5 Cambridge MA, U.S.A.). pMC1403 [M. Casadaban et al . . J. 
Bact. 143 . 971 (1980)]. and pOP203-13 obtained from Dr. 
F. Fuller (Harvard University. Cambridge MA, U.S.A. ) 

The lac genetic elements of plasmid pOP203-13. were, 
recloned from the original pBR322 replicon onto a pAT153 

10 replicon .and renamed pXYl. This; plasmid contains the 
lac promoter and operator elements and the first eight 
amino acids of fi-galactosidase. The -35 region of the. 
trp promoter was provided by pEA300 and the hybrid tac 
promoter (pxy620) was constructed as shown in Figure 

15 lA. To enhance its usefulness in subsequent plasmid 
constructions . the tac promoter and associated eight 
amino acids of fl-galactosidase were recloned onto pXy621 
and subsequently pxy623 as shown in Figure IB. . 

The plasmid pxy623 was then cut with PstI and EcoRI 
20 and separated by agarose gel electrophoresis. A 1000 
base pair piece of DNA was isolated which contained the 
tac promoter ribosome binding site and start codon. 
This was ligated into plasmid pMC1403 which had also 
been cut with PstI and EcoRI. The resulting plasmid 
25 pXY413 contained the tac promoter and ribosome binding 
site, eight amino acids of fl-galactosidase, a polylinker 



region, the enzyma tically _active carbox'y terminal 1015 
a'niino acids of B-galactosidase, and a B-lactamase gene 
, and origin of replication for plasmid selection and 
maintenance (Fig. IC) . 

Plasmid pXY413 does not direct the production of 
large quantities of B-galactosidase under tac promoter 
control as the ribosomes which initiate translation at 
the ATG codon immediately downstream of the ribosoroe 
binding site are not in the same reading frame as the 
structural gene for JS-galactosidase. When plasmid 
pxy413 is transformed into a lac Z host (such as JK103 
or JM105) the colonies are white on X-gal indicator 
plates indicating little or no B-galactosidase' activity 
is present . . 

As a test system for expression, the BamHI site of 
pxy^is was filled in with DNA poll' (Klenow fragment) to 
adjust the reading frame and allow B-galactosidase 
production from the* tac promoter. An example of such a 
construction. pxy415. when transformed into JM105 
produces large amounts of B-galactosidase and gives blue 
colonies on X-gal plates. This is an important feature 
of pxy413 in that desired recombinants may be easily 
selected for' by identifying blue' colonies on X-gal . 
indicator plates . * 
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2. Constri3Ction of Plasmid pWRL240 „ ^ 

A synthetic gene coding for a hybrid FMDV VPl 
antigenic determinant (serotypes and C3) - this is 
the material which I have named Epicylin P/O^^K/CS - . 

5 was assembled from six oligonucleotides. N1-N6 (Figure 
2). cloned into pUC9 (Figure 4). After confirming that 
the resulting plasmid pWRL220 contained . the desired 
sequence (Figure 3).. the synthetic gene was removed by 
cutting the plasmid with EcoRI and BamHI and ligating ' . 

10 the resulting 68 base pair fragment into the open 

reading frame vector pxy413 to construct plasmid pWRL240. 
as. shown in Figure 5. 

.3. Expression and antigenic activity of hybrid protein 

When transformed into JM105. pWRL240 directed the 
15. synthesis of high levels of a hybrid B-galactosidase 
gene and gave blue colonies on X-gal indicator plates. 
The hybrid protein has 19 amino acids of VPl fused to 
the amino-terminus of B-galactosidase (Figure 6). 

The hybrid protein produced by pWRL240 had the same 
20 specific activity as normal B-galactosidase and can be 
purified by standard techniques (Miller, op. cit.) or 
affinity chromatography with antibodies to 
fl-galactosidase. Small amounts of the hybrid protein 
were purified by affinity chromatography and injected 
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into Guinea Pigs together with adjuvant for testing of 
immunogenic activity/ 
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CLAIMS; 

1. A molecyle comprising an amino acid sequence having 
at least two antigenic regions, each said region being . 
capable of being recognized by antibodies raised against 
a viral protein sequence, each said region being formed 
of a chain of amino acids of said sequence, the chains 
forming said regions having at least one amino acid of 
said sequence in common, so that the chains at least - 
partly overlap an^ are not. contiguous along thelength 
of the molecule. 



2. A molecule according to Claim i. in which the ^ 
sequence is such that it adopts a helical conformation. 

3. ' A molecule according to Claim 1 , or Claim .2. which is 
a polypeptide. 

4. A molecule according to any one of the preceding 
Claims, in which said seguence is coupled to a larger 
compound to form a single molecule. 

5. A molecule according to Claim 4. in which said 
larger compo.und is keyhole limpet haemocyamin. bovine 
serum albumin, btg. B-galactosidase or penicillinase. 



6. A molecule according to any one of the preceding 

Cv^ aims, in which said overlap is such that at least 25* 
of the amino acids in one of said chains form part of 
the other said chain. 

7. A molecule according to Claim &. in which at least 
50% of the amino acids in said one chain form part of 

'said other chain. " " ; . 

8. A molecule according to Claim 7. in which at least 
75% of the amino acids in said one chain form part of 
said other chain. 

9. . A molecule according to any one of the preceding 
Claims, in which said segueiice is such that it adopts a 
helical conformation. 

10. A molecule .according to Claim 1. in which said - 
sequence comprises the amino acids: Arg-Arg-Gly-Asp- 
Leu-Gln-Thr-Leu-Ala-Gln-Lys-His-Aia-Arg-His-Leu or an 

antigenic equivalent thereof. 

11. A molecule according to Claim 1. in which said 
sequence comprises the amino acids:" Arg-Ser-Arg-Gly- 
Asp-Leu-Gly-Ser-Ile-Ala-Ala-Arg-Val-Ala-Thr-Gln-Leu or 

an antigenic equivalent thereof.. 



\ ■■ ; • . _3i .^\l^T5_d 

. ^^L ^."^^^^^ °f the -preceding 

Claims, having the antigenicity of two or more .different 
strains or subtypes of foot and mouth disease virus. 

13. A molecule according to Claim li. in which said 
strains are *0^K and C3. 

14. A molecule according to Claim 12. in which said 
strains are AlO and A12 . 

15. A vaccine comprising a molecule according to any one 
of the preceding Claims together with a carrier or 

. adjuvant. 

16. A synthetic peptide comprising a sequence of amino 
acids providing at least two antigenic sites, the amino 
acid sequence giving rise to a first antigenic site 
being interpolated with other amino acids forming at. 

■least part pf a second antigenic site so that the amino 
acids of each site are conformationally contiguous and 
the peptide is capable of being recognised by two 

. antisera separately raised against different viral 
strains . ■ -, 

17. A synthetic peptide as claimed in Claim 16. wherein 
the interpolation Of amino acid comprises the sharing of 
one amino acid by the two antigenic sites. 
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18. A vaccine comprising a peptide as defined in Claim 
16 or 17 together with an adjuvant or- carrier therefor. 

19. ' A vaccine as claimed in Claim 18. vherein the 
carrier is covalently linked to the peptide.. - 



20. A vaccine- according to any one of Claims 15 to 19. 
formiilated for injection. , 



« » < 
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coRI 

Ptac EcoRI 




Sal I 



FIG.1B. 



deleted 
EcoR I site 



1) partial cut with EcoR ] 

2) fill inwith DHA pol I 
(Klenow) 

3) cut with Sal I 
Misolate 897 bp fragment 
5 ) ligation 
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Pst I 



Sail 



1) cut with EcoR I 
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FIG. 2. 

Oligonucleotides N1 - W6 

- 5' 3' length 

FMOV - Nl AATTCATCCCGAGAC6T66T6A 22 

- N2 CCT6CA6ACTCT66CTCAGAAAC 23 
. - N3 AT6CTC6TCACCTCCC6A6C6BB 23 

■ - Hk GATCCCCGCTCGBGABBT6 19 

- N5 ACGABCATGTTTCTGA6CCAGAG . 23 

- N6 TCTBCAG6TCACCAC6TCTCBGBAT6 26 
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FIG. 3. : 
Synthetic DHA for Peptide (P/DiK/Csl 

EcoRI BamHI 
i fJl 22 N2 23 N3 23 ^ 



N6 26 N5 23 19 

{Glul Phe He Pro Arg Arg Gly Asp 

5' AATTC; ATC CCG AGA CBT GGT GAC 

3'6 TAG G6C TCT GCA CCA CT6 

EcoRI Aval 



Thr 


Leu 


AU 


61 n 


Lys 


His 


ACT 


CT.6 


6CT 


CAG 


AAA 


CAT 


TGA 


GAC 


C6A 


GTC 


TTT 


"GTA 



— V — 

PslI 
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Leu 


Pro 


Ser 


GLy 


(Asp) (Pro) 




GCT 


CGT 


CAC 


CTC 


CCG 


AGC 


GGG 




3l 


CGA 


GCA 


BTG 


GA,G 


BGC 


TC,6 


CCC 


CTA G ^ 












Aval 
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F I G . 4 . Construction of a gene for an antigenic determinant 

(Epicylin P/0,K/C3) 




1) Cut with EcoRI and BamHI 

2) Isolate 2-6 Kb Backbone 



3) ligate with oligonucleotides 
N1-NB 

V ■ ■ . ■ 
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FIG.5 . Construction of pWRL2C0 




Fie.6, 

pWRL240 produces a 1DC8 
amino acid hybrid protein 



lac RBS [lac V 

Met-Thr-MeMle-Thr-Asp . 
CACACAB6AMCA6CTAT6 ACC ATG ATT AC6 BAT , , 

......] LSpacer.,.....] [antigenic 

; Ser-Leu-6lu-Phe-Ile-Pro-Arg-Arg 
TDA CTG 6AA TTC ATC CC6 A6A CGT 



determinant........ 

Gly-Asp-Leu-Bln-Thr-Leu-Ala-Gln-Lys-His 
GGT 6AC. CTG CAG ACT CTG GCT GAG AAA CAT 



[spacer....... .. .3 

Al a-A rg-H i s-Leu-Pr o-S e r-6 1 y-A sp-P ro 
GCT CGT CAC CTG CC6 ABC GG6 GAT CCC 



[ ..3-gelactosidase...] 

9 10 1023 
Val Val ... Lys STOP 
BTC 6TT ... AAA TAA 



